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Abstract 

A celebrated theorem of Shoda states that over any field K (of characteristic 0), 
every matrix with trace can be expressed as a commutator AB — BA, or, equivalently, 
that the set of values of the polynomial f{x,y) — xy — yx on MIyj(-/^) contains all 
matrices with trace 0. We generalize Shoda's theorem by showing that every nonzero 
multilinear polynomial of degree at most 3, with coefficients in K, has this property. 
We further conjecture that this holds for every nonzero multilinear polynomial with 
coefficients in K of degree m, provided that m — 1 < n. 
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1 Introduction 

We begin by recalling a theorem, which was originally proved for fields of characteristic 
by Shoda [12j and later extended to all fields by Albert and Muckenhoupt [1]. 

Theorem 1 (Shoda/Albert/Muckenhoupt). Let K he any field, n > 2 an integer, and 
M e Mn{K). If M has trace 0, then M = AB - BA for some A, B e Mn{K). 

For any A,B & M„(i^) the traces of AB and BA are equal, and therefore any matrix 
that can be expressed as a commutator {AB — BA) must have trace 0. (This observation 
is perhaps what prompted Shoda to prove the above result.) Thus, using \M.n{K),M.n{K)] 
to denote the i^-subspace of M.n{K) consisting of the matrices of trace 0, and f(M.n{K)) to 
denote the set of values of a polynomial / on M„(i^), the above theorem can be rephrased 
as follows. 

Corollary 2. Let K be any field, n > 2 an integer, and f{x,y) = xy—yx. Then f {M.n{K)) = 

It is therefore natural to ask what other polynomials / have the property f{M.n{K)) = 
[MniK),Mn{K)], or more generally, f(Mn{K)) D [M„(ir), M„(ir)]. In this note we conjec- 
ture that this is the case for all nonzero multilinear polynomials / (i.e., polynomials that 
are linear in each variable), provided that the degree of / is at most n + 1. We then prove 
this conjecture for multilinear polynomials of degree at most 3, over fields with at least n 
elements, and characterize the polynomials that satisfy f{Wln{K)) = [M„(K), M„(i^)] in 
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this situation. We also prove that for any unital ring i?, the set of values of the polynomial 
xzy — xyz + yzx — zyx on M„(i?) contains all matrices with the property that the sums of 
the elements along the diagonals above (and including) the main diagonal are 0. 

Since [M„(i^), M„(i^')] is a i^-subspace of M.n{K), our conjecture and result above can 
be viewed as a partial answer to a more specific version of the following question of Lvov [H 
Entry 1.98], which also has been attributed to Kaplansky (see [6]). 

Question 3 (Lvov). Let f be a multilinear polynomial over a field K . Is the set of values 
of f on the matrix algebra M„(il') a vector space? 

Kanel-Belov, Malev, and Rowen [6j have answered this question in the case where n = 2 
and K is quadratically closed (that is, every non-constant quadratic polynomial over K has 
a root in K). More specifically, they showed that in this case, the image of a multilinear 
polynomial must be one of 0, i^' ■ I2 (where I2 is the identity matrix), \M.2{K) ,'M.2{K)\, or 
M2{K). 

Other special cases of Question [3] have been explored elsewhere. For instance, Khurana 
and Lam [HI Corollary 3.16] showed that the set of values of the multilinear polynomial 
xyz — zyx on M„(K), over an arbitrary field K, is all of M„(ii'), when n > 2 (along 
with more general versions of this result for other rings R in place of the field K). Also, 
Kaplansky [TJ Problem 16] asked whether there exists a nonzero multilinear polynomial over 
a field K which always takes values in the center of M„(i^'), or, equivalently, whether there 
is such a polynomial whose set of values on M„(i^') is the center of M„(i^'). (This question 
arose in the study of polynomial identities on rings.) Such polynomials have indeed been 
constructed by Formanek [5J and Razmyslov [lOj . 

All the results and questions mentioned above are stated for multilinear (rather than 
arbitrary) polynomials, since in general, the set of values of a polynomial on M„(/r) is not 
a subspace. More specifically, Chuang [3] showed that if K is finite, then every subset S of 
M„(ir) that contains and is closed under conjugation is the image of some polynomial. Of 
course, such subsets S of M„(_ft') are generally not i^-subspaces. Let us also give an example 
of a (non- multilinear) polynomial / and an infinite field K such that /(M„(i^)) is not a 
fT-subspace of Mn{K). 

Example 4. Let K be an algebraically closed field, n > 2, and f{x) = x". For all distinct 
1 < i, j < n we have /(-Em) = En and f{Eii + Eij) = En + Eij, where Eij are the matrix 
units. Hence the i^-subspace generated by /(M„(if)) contains all E^j, and therefore must 
be M.n{K). Thus, to conclude that /(M„(_ft')) is not a subspace, it suffices to show that 
f(M„{K)) ^ M„(i^). Let A e M„(i^) be a nonzero nilpotent matrix. Since is the only 
eigenvalue of A, the Jordan canonical form of A must be strictly upper-triangular, from 
which it follows that A" = 0. Thus, if A = f{B) = B"" for some B E M^iK), then B must 
be nilpotent, and hence f{B) = = (by the above argument), contradicting A ^ 0. 
Therefore, f{M.n{K)) contains no nonzero nilpotent matrices, and hence f(Mn{K)) is not a 
subspace of M„(K). □ 

We conclude the Introduction by mentioning that there has been interest in generalizing 
Theorem [1] in other directions, particularly, in determining whether the same statement 
holds over an arbitrary (unital) ring. That is, given a ring R, can every matrix M G M„(i?) 
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having trace be expressed as a commutator M = AB — BA, for some A,B & M„(i?)? 
Intriguingly, this question remained open until 2000, when Rosset and Rosset [11] produced 
a ring R and a matrix M e M2(-R) having trace that cannot be expressed as a commutator. 
On the other hand, it turns out that over any unital ring R and for any n >2, every matrix 
M G M„(i?) having trace can be expressed as a sum of two commutators (see jHl Theorem 
15]). To the best of our knowledge, however, there is still no classification of the rings R with 
the property that every matrix with trace is a commutator in M„(i?), despite continued 
attention to the question. 

2 Preliminaries 

Let us next define multilinear polynomials more rigorously, and then collect some basic 
facts about them. 

Definition 5. Given a field K and a positive integer m, we denote by K{xi, . . . ,Xm) 
the K-algehra freely generated by the {non- commuting) variables xi, . . . ,Xm- A polynomial 
/(xi, . . . , Xm) e K{xi, . . . , Xm) is Said to be multilinear (of degree m) if it is of the form 

f{Xi, . . . , Xm) = Cto-2;o-(l)Xo-(2) • • • Xo-{m)y 

cr&Sm 

where Sm is the group of all permutations o/ {1, . . . , m} and G K . 

Some authors call these polynomials homogeneous multilinear. The motivation for calling 
such polynomials "multilinear" is that given any fT-algebra A, any element /(xi, . . . ,Xm) G 
K{xi, . . . , Xm) can be viewed as a map f : A x ■ ■ ■ x A ^ Ahj evaluating the Xi on elements 
of A, and it is easy to see that the polynomials of the above form are precisely the ones that 
give rise to maps f : A x ■ ■ ■ x A ^ A that are linear in each variable (for every if-algebra 
A). That is, maps / such that for all a, 6 G i^, j G {1, . . . , m}, and ri, . . . , r^, r'j G A, we 
have 

/(ri, . . . , rj_i, avj + br'j, r^+i, . . . , r^) 
= a/(ri, . . . , rj_i, r^, r^+i, . . . , r^) + 6/(ri, . . . , r^.i, r^., r^+i, . . . , r^). 

Given a polynomial f{xi, . . . ,Xm) G K{xi, . . . ,Xm) and i^'-algebra A, we set f{A) = 
{/(ri, . . . , rm) I ri, . . . ,rm G A}. Also, for any associative ring R we shall denote by [r,p] 
the commutator rp — pr of r,p E R, denote by [R, R] the additive subgroup of R generated 
by the commutators, and let [r, R] = {[r,p] | p G R} for r E R. 

In the next lemma we record a couple basic facts about multilinear polynomials for later 
reference. Both claims follow immediately from Definition [5l 

Lemma 6. Let K be a field, A a K-algebra, m a positive integer, and f{xi, . . . ,Xm) £ 
K{xi, . . . ,Xm) CL multilinear polynomial. Also, let ri,...,rm G A be arbitrary elements. 
Then the following hold. 

(1) For any a E K we have a/(ri, . . . , r^) = /(ari, . . . , r^). 

(2) For any invertible p E A we have pf{ri, . . . ,rm)p^^ = f{prip~^, . . . ,prmP~^)- 
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The next lemma, which is an easy consequence of Theorem [H describes the sets of values 
of multilinear polynomials of degree at most 2 on matrix algebras. 

Lemma 7. Let K he a field, m a positive integer, and f{xi, . . . ,Xm) € K{xi, . . . ,Xm) a 
multilinear polynomial. 

(1) Ifm = l, thenfiMn{K)) G {Q,Mn{K)}. 

(2) Ifm = 2, then /(M„(ir)) G {0, [M„(ir), M„(ir)], M„(ir)}. 

Proof. If m = 1, then / must be of the form f{x) = ax for some a E K, from which (1) 
follows. 

If m = 2, then / must be of the form f{x, y) = axy + byx for some a,b E K. If a + b ^ 0, 
then = (a + b)Y for all Y G Mn{K), which implies that /(M„(K)) = M„(ir). 

While, if a + 6 = 0, then f{x, y) = a{xy — yx), which has the same image as the polynomial 
xy — yx, as long as a 7^ 0, by Lemma [6](1). Statement (2) now follows from Corollary [2l □ 

The following fact will be useful in subsequent arguments. For a polynomial /(xi, . . . , Xm) £ 
K{xi, . . . , Xm) and 1 < / < m we view f{xi, . . . , x;, 1, . . . , 1) as a polynomial in K{xi, . . . ,xi). 

Corollary 8. Let K be a field, m a positive integer, and 

f{Xi, Xm) = ^ aaXa{l)XcT{2) ■ ■ ■ ^a{rn) ^ K{xi, ...,Xm) 

a multilinear polynomial. 

(1) If E.e5,„ ^ 0' ihen /(M„(i^)) = M^{K). 

(2) Ifm>2 andf{xi,X2,l,...A)^0, then f{M^{K)) G {[M„(ir), M„(i^)], M„(ir)}. 

Proof. This follows from Lemma [7] upon noting that /(xi, !,...,!) = (Xlo-eS'm 

/(xi, X2, 1, . . . , 1) = bxiX2 + CX2X1, for some b,c E K. □ 

3 A conjecture 

The main goal of this section is to justify a conjecture regarding the set of values of certain 
multilinear polynomials. We shall first require the following result of Amitsur and Rowen [21 
Proposition L8] to prove a fact about the linear spans of the images of our polynomials. 

Proposition 9 (Amitsur/Rowen). Let D be a division ring, n > 2 an integer, and A G 
Wln{D). Then A is similar to a matrix in M„(D) with at most one nonzero entry on the 
main diagonal. In particular, if A has trace zero, then it is similar to a matrix in M„(D) 
with only zeros on the main diagonal. 

Proposition 10. Let K be a field, n > 2 and m > 1 integers, and f{xi, . . . ,Xm) ol nonzero 
multilinear polynomial in K{xi, . . . ,Xm)- If n >m — l, then the K-subspace (/(M„(ir))) of 
M,^{K) generated by fiM^^K)) contains [M„(is:), M„(ir)] . 
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Proof. Write 

/(Xi, . . . , ^ ^ Ota-^ <j{2) ■ ■ ■ ■^cr{m)i 

for some a„ G K. Since / is nonzero, upon relabeling the variables if necessary, we may as- 
sume that ai ^ 0. Furthermore, by LemmaH if m < 2, then [Mn{K),Mn{K)] C /(M„(ir)) 
for any n > 2. We therefore may assume that m > 3. 

Let i,j G {1, . . . ,n} be distinct. Since n > m — 1 > 2, we can find distinct elements 
h, . . . , lm-3 € {1, • • • , ''^j \ {hj}- Letting Eki denote the matrix units, we then have 

f{Eii, Eij, Eji^,Ei^i2, . . . , Ei^_^i^_^, Ei^_^j) 

= aiEnEijEji^Ei^i^ . . . Ei^_^i^_^Ei^_^j + = aiEij, 

since multiplying En, Eij, Eji^, Ei^i^, . . . , Ei^_^i^_.^, Ei^^^j in any other order yields 0. (If 
m = 3, then we interpret the above equation as f{Eii,Eij,Ejj) = aiEij.) Therefore, Eij G 
f{M.n{K)) for all distinct i and j, and hence (/(M„(i^))) contains all matrices with zeros 
on the main diagonal. Now, Lemma [6] implies that (/(M„(i^))) is closed under conjugation. 
Hence, by Proposition El we have [M„(i\:), M„(if )] C (/(M„(ir))). □ 

The claim in the above proposition does not in general hold when n < m—1. For example, 
it is known that the set of values of the polynomial {xy — yxY on 'Ni2{K) is contained in the 
center K ■ I2 (see [7]). The same is true of the linearization 

/(xi,x2,i/i,?/2) = [xi,yi][x2,y2] + [xi,y2][x2,yi] + [x2,yi][xi,y2] + [x2,y2][xi,yi] 

of this polynomial. Thus, {f{M2{K))) = K ■ h ^ [M2{K),M2{K)]. 

An affirmative answer to Question [3] would imply that in the above proposition, if n > 
m - 1, then /(M„(Js:)), and not just (/(M„(ir))), contains [M„(i^), M„(K)]. Since it is 
suspected that the question does have an affirmative answer (e.g., see [B]), we make the 
following conjecture. 

Conjecture 11. Let K be a field, n > 2 and m > 1 integers, and f{xi, . . . ,Xm) a nonzero 
multilinear polynomial in K{xi, . . . ,Xm)- If n > m — 1, then /(M„(K)) 3 [M„(_ft'), M„(K)] . 

Lemma [7] shows that this conjecture holds for m < 3. In the next section we shall show 
that it holds for m = 3 as well, if K has at least n elements. 

4 The three-variable case 

We shall require another fact proved by Amitsur and Rowen [51 Lemma 1.2]. 

Lemma 12 (Amitsur /Rowen). Let K he a field and n > 2 an integer. Suppose that A = 
(aij) G M.n{K) is a diagonal matrix with an 7^ ajj fori 7^ j. Then [A, M„(ir)] consists of 
all the matrices with only zeros on the main diagonal. 

We are now ready for our main result. 
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Theorem 13. Let n > 2 be an integer, K a field with at least n elements, and f G K{x, y, z) 
any nonzero multilinear polynomial. Then /(M„(K)) contains every matrix having trace 0. 

Proof. If / has degree at most 2, then this foUows from Lemma [71 Thus, let us assume that 
the degree of / is 3. We can then write 

f{x, y, z) = axyz + bxzy + cyxz + dyzx + ezxy + gzyx (a, 6, c, d,e, g & K). 

lia + b + c + d + e + g^O, then [M„(K), M„(ir)] C /(M„(K)), Corollary [8](1). Let us 
therefore suppose that a + b + c + d + e + g = 0. In this case 

f{x, y, z) = a{xyz — zyx) + b{xzy — zyx) + c{yxz — zyx) + d{yzx — zyx) + e{zxy — zyx). 

Moreover, if any of f{l,y,z), f{x,l,z), or f{x,y,l) are nonzero, then by Corollary [8](2), 
[Mn{K),Mn{K)] C f{Mn{K)). Thus, let us assume that 

= /(I, z) = a{yz - zy) + c{yz - zy) + d{yz - zy) = {a + c + d){yz - zy), 

which implies that Q = a + c + d. Setting = /(a;, 1, z) and = /(a;, y, 1) we similarly get 
= a + 6 + c and = a + 6 + e. Solving the resulting system of equations gives b = d, c = e, 
and a = — 6 — c. Therefore, 

f{x, y, z) = {—b — c){xyz — zyx) + b{xzy — zyx + yzx — zyx) + c{yxz — zyx + zxy — zyx) 

= b{xzy — zyx + yzx — xyz) + c{yxz — zyx + zxy — xyz) 

= b{x[z,y] - [z,y]x) + c{z[x,y] - [x,y]z) = b[x, [z,y]] + c[z, [x,y]]. 

Now, since f{x,y,z) ^ 0, renaming the variables, if necessary, we may assume that 6 7^ 0. 
Then, by Lemma [6]^1), f{x,y,z) and b~^f{x,y,z) have the same set of values. Therefore, 
we may assume that f{x, y, z) = [x, [z, y]] + b[z, [x, y]] for some b E K. 

Let A e Mn{K) be a matrix having trace 0. We wish to show that A G /(M„(K)). By 
Proposition [9l A is conjugate to a matrix A' G M.n{K) with only zeros on the main diagonal. 
Hence, by Lemma [6](2), to conclude the proof it is enough to show that that A' G f(Mn{K)). 
By our assumption on K, we can find a diagonal matrix M G M„(i^) with distinct elements 
of K on its main diagonal. By Lemma fT2l there is some B G M.n{K) such that A' = [M, B]. 
Now, write B = C + D, where C has only zeros on the main diagonal and D is diagonal. 
Then 

A' = [M, B] = [M, C + D] = [M, C] + [M, D] = [M, C], 

since M commutes with all diagonal matrices. Using Lemma [12] once again, we can find a 
matrix E G Mn{K) such that C = [E, M]. Finally, we have 

/(M, M, E) = [M, [E, M]] + b[E, [M, M]] = [M, [E, M]] = [M, C] = A', 

as desired. □ 

Let us next describe the degree-three multilinear polynomials / satisfying /(M„(K)) = 
[M^iK),M^{K)]. 
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Lemma 14. Let n > 2 be an integer, K a field, and 

f{Xi,X2,X3) = ^ a„X„^i)X^(2)Xa(3) ^ K{xi,X2,X^) 

a multilinear polynomial of degree 3. Then f{Wln{K)) C [M.n{K),'M.n{K)] if and only if 
J2aes-i c^cr = = XlcrgAa '^CT' where C S3 is the alternating subgroup. 

Proof. By Corollary [Ht^l), we may assume that J^aes-s ~ ^- Now, suppose that Xlo-gAa ~ 
0. Then we must also have X]o-g53\A3 '^o- = J^aes^ '^o" ^ So-eAs '^c = 0. Therefore, 

/(xi,X2,a;3) = ^ aCTa;<T(i)a;a(2)a;a(3) + ^ a<7a;<7(i)a;a(2)a;a(3) 
o-eAs o-e53\A3 

= ^ a<7(a;<x{i)a;(7(2)a;a(3) - a;ia;2a;3) + ^ aa(a;a(i)a;a(2)a;<7{3) - a;2a;iX3) 
<7gA3\{i} ae53\(A3U{{i2)}) 

= 0(123) (a;2a;3a;i - 2:1X2X3) + 0(132) (a;3a;iX2 - X1X2X3) 

+0(i3)(3;3a^2a;i - 3:2X1X3) + 0.(23) (a;ia^3a^2 - 3:2X1X3) 

= 0(i23) [3^23^3,2:1] + 0(i32)[x3,XiX2] + 0(13) [X3, X2X1] + 0(23) [3:1X3, X2]. 

Thus, f{Mn{K)) C [Mn{K),Mn{K)]. 

On the other hand, /(-En, -E'12, -^21) = Oi^n + a(i23)-Eii + 0(i32)-E22 has trace Z]aeA3 
Hence, if this sum is not zero, then /(M„(is:)) ^ [M„(is:), M„(ir)]. □ 

Applying Theorem [13] to this lemma we obtain the following. 

Corollary 15. Let n > 2 be an integer, K a field with at least n elements, and 

/(Xi,X2,X3) = O^X^(i)X^(2)X<x(3) G K(xi,X2,X3) 

a nonzero multilinear polynomial of degree 3. Then /(M„(i^')) = [M„(i^'), M„(_ft')] if and 
only if 'Yliaes-i Oo- = = 'YliaeAs where C 5*3 is the alternating subgroup. 

We note that in general the condition J2aes o^- = = Xlo-eA does not characterize 
the nonzero multilinear polynomials /(xi, . . . , x.^) satisfying f{M.n{K)) = [M„(ir), M„(if)]. 
For example, if m = 2, then J2aeSm o^ = = ^^.^^^ o^- implies that /(xi,X2) = 0. Also, 
/(xi,X2,X3,X4) = X1X2X3X4-X4X3X2X1 satisfies Y^aes^ o^ = = J^aeA^ '^^^ but f{Mn{K)) ^ 
[M„(i^),M„(i^)] for any n > 2, since /(En, E12, ^22, ^21) = ^n ^ [M„(ir), M„(K)]. 

Let us conclude with a fact about the image of the degree-three multilinear polynomial 
[x, [y, z\\ on matrices over an arbitrary ring. We first require the following lemma. 

Lemma 16. Let R be a unital ring, let n > 2 be an integer, and let A = {aij) G M„(i?) be 
such that for each j G {0, 1, . . . , n — 1} we have Yl^=i = {i.e., sums of the elements 

along the diagonals above and including the main diagonal are 0). Then A = DX — XD for 
some D G M„(i?) and X = Y17=i where E^j are the matrix units. 
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Proof. Letting Z = X;r=i ^i^i+i have ZX = X;r=i = I-E^n- For any / G {0, 1, . . . 
1} and A; G {1, 2, . . . , n} we then have 



n- 



Ei+l^i \ A I Ei^i^i j Enn = Ek,k~lAE, 

1=1 / V j=l / 

iil<k, and EkkX^AZ^E^n = • AZ^Enn = if I > k. 
Letting D = ^"^'^ X'AZ'+\ we have 



n—Ln 



dk-Ln-lEkn 



n-2 



DX - XD = iJ2 ]^^~Y1 X'+'^AZ'+^ 



i=0 
n-l 



n-2 



i=0 
n-2 



A + ^ X'AZ' J (/ - Kn) - = ^ " ( 5Z ) ~ 

i=l ^ i=l ^ j=0 ^ 

(n— 2 \ ^ ra— 1 \ 

X'AZ' \ Enn - X^-'AE,n = ^ " ( X' AZ' I ^nn. 



Now, for every A; G {1, 2, . . . , n} we have 



(n— 1 \ k—1 / k—\ \ / ^ \ 

X^ AZ' \ Enn = EkkX^AZ'^Enn = ( (lk-i,n-i j Ekn = ( ^i,i+(n—k) 



by the computation in the first paragraph. Finally, the last sum is 0, by hypothesis on A, 
and hence i^^Zl X'AZ')Enn = 0, showing that DX - XD = A. □ 

Theorem 17. Let R be unital ring, let n > 2 be an integer, let f{x,y,z) = [x, [z,y]], and 



let A = {ttij) G M„(i?) be such that for each j G {0, 1, . . . , n — 1} we have ^11=1 
Then A = f{D, X, Y) for some D G M„(i?), X = ^^j/ and Y = Ztii^ ' ^)Eii. 

Proof. For any matrix M = {niij) G M„(i^r) we have 



[Y,M] 



0. 
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= {{i - l)mij) 


- ((j - 


1 




) = ((^-J 


)mij). 











Hence, in particular, [Y, X] = X. 

Now, by Lemma [TB| A= [D,X] for some D G M„(i?). We therefore have 

f{D,X,Y) = [D,[Y,X]] = [D,X] = A, 

proving the statement. □ 
This argument extends easily to all polynomials of the form [xi, [x2, . . . , Xm]] . . . ]. 
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